Rank in Wordlist | Frequency | Word |
---|---|---|
4168 | 4 | 0,2 |
4169 | 4 | 1,5 |
5347 | 3 | 0,4 |
5348 | 3 | 1,1 |
5377 | 3 | 2,1 |
5378 | 3 | 2,7 |
5384 | 3 | 4,5 |
5392 | 3 | 7,5 |
7404 | 2 | 1,3 |
7405 | 2 | 1,34 |
Rank in Wordlist | Frequency | Word |
---|---|---|
9843 | 2 | Sprache(Rumänisches |
14534 | 1 | Box-(N$ |
17037 | 1 | Galli(etc. |
17972 | 1 | Hand(werks-)arbeiten |
21689 | 1 | Ngarangombe(Eland |
22828 | 1 | Radsport-(N$ |
26642 | 1 | Weihnachts-(Kunst-)Handwerksmarkt |
31045 | 1 | mach(t |
31433 | 1 | o.O.(Magdeburg |
32287 | 1 | streng(er |
Rank in Wordlist | Frequency | Word |
---|---|---|
12102 | 1 | 0)64 |
12135 | 1 | 0:0)-Heimsieg |
12137 | 1 | 0:1)-Niederlage |
17389 | 1 | Geschoss)Kugeln |
17972 | 1 | Hand(werks-)arbeiten |
21490 | 1 | Namibia-)Deutschen |
22439 | 1 | Planeten)-Kampagne |
23825 | 1 | Schul-)Bildung |
26474 | 1 | Wahl-)Heimat |
26642 | 1 | Weihnachts-(Kunst-)Handwerksmarkt |
Rank in Wordlist | Frequency | Word |
---|---|---|
5398 | 3 | 8% |
7499 | 2 | 51% |
12106 | 1 | 0,2% |
12109 | 1 | 0,3% |
12116 | 1 | 0,9% |
12118 | 1 | 0.3% |
12140 | 1 | 1% |
12147 | 1 | 1,4% |
12160 | 1 | 10% |
12195 | 1 | 12% |
Rank in Wordlist | Frequency | Word |
---|---|---|
7768 | 2 | Bed & Breakfast |
12100 | 1 | & Co |
13026 | 1 | Adam & Eva |
13838 | 1 | B&B-Unterkünfte |
16266 | 1 | F&H |
17043 | 1 | Galz&Goals |
18248 | 1 | Herrle&Herma |
21843 | 1 | O&L |
21844 | 1 | O&L Leisure |
21845 | 1 | O&L- |
Rank in Wordlist | Frequency | Word |
---|---|---|
150 | 111 | N$ |
14534 | 1 | Box-(N$ |
21298 | 1 | N$. |
21299 | 1 | N$12,50 |
21300 | 1 | N$60 |
22828 | 1 | Radsport-(N$ |
25605 | 1 | U$ |
Rank in Wordlist | Frequency | Word |
---|---|---|
1924 | 10 | gibt's |
5303 | 4 | war's |
9191 | 2 | N/a'an |
9391 | 2 | Paul's |
9852 | 2 | St. Paul's |
11051 | 2 | geht's |
13283 | 1 | Amphi's |
14522 | 1 | Boulder-size'-Klasten |
14999 | 1 | Clarissa's |
15122 | 1 | D'Almeida |
Rank in Wordlist | Frequency | Word |
---|---|---|
3602 | 5 | Google + |
Rank in Wordlist | Frequency | Word |
---|---|---|
2939 | 6 | 2013/14 |
3056 | 6 | HIV/Aids |
4860 | 4 | Windhoek/Swakopmund |
6277 | 3 | Reuters/ech |
6411 | 3 | Swakopmund/Walvis |
6585 | 3 | Windhoek/Berlin |
6586 | 3 | Windhoek/Otjiwarongo |
7254 | 3 | und/oder |
9191 | 2 | N/a'an |
9228 | 2 | Nampa/ech |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots